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ABSTRACT 

Rao's  work  is  always  inspired  by  some  practical  problems.   It  is 
not  surprising  that  many  of  the  techniques  he  developed  found  their  way 
to  various  applications  in  econometrics.   In  this  review  paper,  we  try 
to  evaluate  the  impact  of  his  pioneering  1948  score  test  on  the  current 
practice  of  econometric  model  specification  tests.   In  so  doing,  we 
relate  Rao's  score  principle  to  some  of  the  earlier  popular  tests  in 
econometrics.   We  also  discuss  the  recent  developments  on  this  topic  and 
problems  for  further  research. 


1 .   INTRODUCTION 

The  history  of  statistical  hypothesis  testing  is  indeed  a  very 
long  one.   However,  if  one  has  to  pick  a  few  papers  that  have  profound 
influence  on  the  current  practice,  the  following  three  papers  must  be 
among  those:   Neyman  and  Pearson  (1928),  Wald  (1943)  and  Rao  (1948). 
These  papers  suggested  three  basic  principles  of  testing,  likelihood 
ratio  (LR) ,  Wald  (W)  and  Rao's  score  (RS),  respectively.   For  a  long 
time  econometricians  have  been  invariably  using  the  LR  test  and  some 
version  of  the  Wald  principle.   Score  test  was  rarely  used  explicitly, 
although  some  of  the  earlier  econometric  tests  could  be  given  score  test 
interpretation.   In  the  late  seventies  and  eighties,  however,  we 
observed  intense  activities  in  the  application  of  score  principle  to  a 
variety  of  econometric  testing  problems  and  studying  the  properties  of 
the  resulting  tests.   And  now  RS  tests  are  the  most  common  items  in  the 
econometricians'  kit  of  testing  tools. 

The  aim  of  this  review  paper  is  very  modest.   Our  main  purpose  is 
to  highlight  the  role  of  RS  test  in  the  current  development  of  model 
specification  and  diagnostic  checks  in  econometrics.   Applications  of  RS 
test  in  econometrics  are  so  vast  and  the  number  of  papers  on  this  topic 
is  so  numerous  that  it  is  an  impossible  task  to  provide  a  balanced  and 
comprehensive  review.   There  are  already  some  survey  papers  and 
monographs  that  cover  RS  test.   In  particular,  we  would  refer  the 
readers  to  Breusch  and  Pagan  (1980),  Engle  (1984),  Kramer  and  Sonnberger 
(1986)  and  Godfrey  (1988).   Most  of  the  recent  textbooks  also  discuss  RS 
test,  for  example,  see  White  (1984,  pp.  72-74),  Amemiya  (1985, 
pp.  141-146),  Judge  et  al.  (1985,  pp.  182-187),  Kmenta  (1986, 
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pp.  493-495),  Spanos  (1986,  pp.  326-336),  Maddala  (1988,  pp.  137-139), 
Green  (1990,  pp.  357-359)  and  Harvey  (1990,  pp.  169-177). 

The  plan  of  the  paper  is  as  follows.   In  the  next  section,  we 
define  the  RS  test  statistic  and  discuss  some  of  its  properties, 
particularly  its  invariance  and  asymptotically  equivalent  forms.   In 
Section  3,  we  mention  some  of  the  popular  applications  in  econometric 
model  evaluation.   Many  of  the  old  and  new  econometric  tests  could  be 
given  score  test  interpretation,  and  these  are  mentioned  in  Section  4. 
There  are  several  advantages  of  RS  test  compared  to  the  LR  and  W  tests 
and  Section  5  discusses  that  aspect.   There  still  remain  many  unsolved 
problems.   In  Section  6,  we  provide  a  brief  review  of  recent 
developments  and  mention  some  problems  for  further  research.   Last 
section  concludes  the  paper  with  a  few  remarks. 

2.   SCORE  TEST  AND  ITS  PROPERTIES 
Suppose  there  are  n  independent  observations,  y^y,/  . . .  ,y_  with 
identical  density  function  f(y;9)  where  6  is  a  p  x  l  parameter  vector 
with  8e©  c  rP.   it  is  assumed  that  f(y;6)  satisfies  the  regularity 
conditions  stated  in  Rao  (1973,  p.  364)  and  Serfling  (1980,  p.  144). 
The  log-likelihood  function,  the  score  vector  and  the  information  matrix 
are  then  defined,  respectively,  as 

{(6)  =  T    in  f(yitB) 

i-l 


4(6)  =  a«6) 


ae 

and 


7(6)  =  -,  »™ 

ODOD 


The  hypothesis  to  be  tested  is  HQ:   h(6)  =  c  where  h(6)  is  an 
r  x  l  vector  function  of  6  with  r  <  p  and  c  is  a  known  constant  vector. 
It  is  assumed  that  H  =  H(8)  =  dh(0)/80  has  full  column  rank,  i.e.,  rank 
[H(9)]  =  r.   The  RS  statistic  for  testing  H  can  be  written  as 

RS  =  d'(B)  J(6)-1d(6)  (2-1) 

where  """  indicates  that  the  quantities  have  been  evaluated  at  the 
restricted  maximum  likelihood  estimate  (MLE)  of  0,  say  6. 

The  idea  behind  this  test  is  that  if  H  is  true,  d(0)  is  expected 

to  be  close  to  zero  by  virtue  of  the  fact  that  d(0)  =  0,  where  0  is  the 
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unrestricted  MLE  of  6.   Under  H  ,  RS  is  asymptotically  distributed  as  % 

[see,  for  example,  Rao  (1973,  pp.  418-419),  Serfling  (1980,  pp.  156-160) 

and  Godfrey  (1988,  pp.  13-15)].   Econometricians  use  the  above  score 

form  of  the  test.   However,  it  is  often  called  as  the  Lagrange 

multiplier  (LM)  test.   The  terminology  LM  test  came  from  the  two 

articles  Aitchison  and  Silvey  (1958)  and  Silvey  (1959),  where  an  LM 

interpretation  of  (2.1)  was  given. 

Note  that  0  can  be  obtained  from  the  solution  to  the  equations 


d(fl)  -  H(B)X   =  0 
and 

h(B)    =  c. 
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where  X's   are  the  Lagrange  multipliers.   Therefore,  we  have 
d(6)  =  H(6)X.   Given  that  H(6)  has  full  rank,  d(6)  =  0  is  equivalent  to 
X  =  0,  i.e.,  the  Lagrange  multipliers  vanish.   These  multipliers  can  be 
interpreted  as  the  implicit  cost  (shadow  prices)  of  imposing  the 
restrictions.   It  can  be  shown  that 


T  .  dt  (8) 


that  is,  the  multipliers  give  the  rate  of  change  of  the  maximum 
attainable  value  with  respect  to  the  change  in  the  constraints.   If 
H  :  h(0)  =  c  is  true  and  £(6)  gives  the  optimal  value,  X   should  be 
close  to  zero.   Given  this  "economic"  interpretation  in  terms  of 
multipliers,  it  is  not  surprising  that  the  econometricians  prefer  the 
term  LM  rather  than  RS.   In  terms  of  Lagrange  multipliers,  (2.1)  can  be 
expressed  as 

lm  =  X'iT(B)  l(B)-lH(B)X  <2-2> 

It  is  clear  from  (2.1)  and  (2.2)  that  RS  form  of  the  test  is  much  easier 
to  compute,  and  this  explains  its  popularity  among  econometricians. 
However,  as  explained  above,  they  call  it  by  a  name  which  is  closer  to 
economists'  way  of  thinking. 

From  (2.1)  it  is  clear  that  the  RS  statistic  is  essentially  a 
distance  measure  between  the  null  and  alternative  hypotheses,  and  it  can 
be  given  Mahalanobis  norm  interpretation  [see  Del  Pino  (1990)].   This 
leads  to  the  development  of  alternative  forms  of  RS  test  by  using 


different  types  of  distance  measures  [see  Ullah  (1989)].   Note  that  the 

RS  statistic  does  not  depend  on  the  alternative  hypothesis  explicitly, 

since  it  uses  the  slope  of  j2(6)  at  6  =  6.   There  may  be  different 

likelihood  function  (having  the  same  slope  and  possibly  the  same 

curvature  at  H  )  that  will  lead  to  the  same  RS  statistic.   This 
o' 

phenomenon  is  called  the  invariance  property  of  the  RS  test.   There  are 
many  examples  of  this,  but  here  we  mention  only  a  few.   The  RS  statistic 
for  testing  normality  suggested  in  Bera  and  Jargue  (1981)  and  Jargue  and 
Bera  (1987)  with  Pearson  family  of  distributions  as  the  alternative, 
remains  unchanged  under  Gram-Charlier  (type  A)  alternatives  [see  Bera 
(1982a,  p.  98)].   Statistics  for  testing  homoskedasticity  are  invariant 
to  different  forms  of  alternatives  such  as  multiplicative  and  additive 
heteroskedasticities  as  has  been  noted  by  Breusch  and  Pagan  (1979)  and 
Godfrey  and  Wickens  (1982).   Testing  serial  independence  against  the 
p-th  order  autoregressive  [AR(p)]  or  the  p-th  order  moving  average 
[MA(p) ]  processes  lead  to  the  same  test  statistic,  see  for  instance 
Breusch  (1978)  and  Godfrey  (1978a).   Pesaran  (1979)  found  that  the  RS 
test  does  not  differentiate  between  polynomial  and  rational  distributed 
lags. 

These  examples  raise  the  guestion  whether  the  RS  test  will  be 
inferior  to  other  asymptotically  eguivalent  procedures  such  as  the  W  and 
LR  tests,  with  respect  to  power,  since  it  does  not  use  precise 
information  of  the  alternative  hypothesis.   However,  the  Monte  Carlo 
results  of  Godfrey  (1981)  and  Bera  and  McKenzie  (1986)  suggest  that 
apparently  there  is  no  setback  in  the  performance  of  the  RS  test 
compared  to  the  LR  test.   Using  locally  eguivalent  alternative  (LEA) 
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models,  Godfrey  and  Wickens  (1982)  provide  some  theoretical 
underpinnings  behind  the  invar iance  property  of  the  RS  test.   Their 
analyses  also  highlight  the  fact  that  a  proper  choice  of  the  alternative 
sometimes  greatly  reduces  the  computational  complexity  of  a  test.   The 
invariance  property  also  cautions  researchers  against  automatically 
accepting  the  alternative  hypothesis  when  the  null  is  rejected.   That 
is,  if  we  reject  the  null,  it  does  not  imply  that  the  alternative  that 
has  been  used  to  derive  the  test  is  correct. 

We  can  calculate  the  variance  of  d(6)  in  a  number  of  ways  which 
are  asymptotically  the  same.   This  leads  to  different  versions  of  the  RS 
test  with  different  properties  in  small  samples.   In  fact,  in  the 
general  formula  (2.1),  if  we  substitute  any  positive  definite  matrix 
A(0)  in  place  of  1(6)  such  that 


plim 


*<fl>-(w) 


=  h 


where  I  is  a  p  x  p  identity  matrix,  then  all  the  statistics  of  the  form 
d' (6)A(6)  *1d(0)  will  be  asymptotically  equivalent.   Two  immediate 
choices  of  A(8)  are 

Ar(6)   =  E[d(B)   •  d'<8)]  (2.3a) 

and 

ve>.*[-wl  (2'3b> 
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If  the  specified  probability  model  is  correct  then  A.,  (6)  =  ^(6)  [see 
White  (1982)].   In  that  case  either  A.,  (6)  or  A2(6)  could  be  used,  and  we 
will  denote  the  statistic  as  RS(WE),  "WE"  stands  for  "with  expectation" 
using  the  terminology  in  Bera  and  McKenzie  (1986). 

In  certain  situations,  it  may  be  difficult  to  calculate  the  above 
expectations  analytically,  for  example,  when  testing  linear  and 
log-linear  models  using  Godfrey  and  Wickens  (1981)  approach.   In  such 
cases  we  can  use  the  Hessian  form 

A3(6)  =  -  **M  (2.4a) 

or  the  outer  product  form 

A^e)    =  G(6)G'(8)  (2.4b) 

where  G(6)  has  typical  (i,j)-th  element  as  d£-(B) /d&-,    (i  =  l,2,...,n; 
j  =  l,2,...,p)  with  £-  =  lnf(y.,0).   When  AA(8)  is  used,  noting  that 
d(6)  =  G'(9)l,  the  RS  statistic  can  be  computed  as 


RS   =  l'G(fl)  [G'  (9)  G(0)  ]  -XG'  (9)1 

(2.5) 

=  nR2 


where  1,  is  an  n  x  l  vector  of  ones  and  R2  is  the  uncentered  coefficient 
of  determination  of  regressing  1  on  G(6)  (see  Godfrey  and  Wickens 
(1981)].   Bera  (1982b)  argued  that  this  statistic  can  be  written  in 
Hotelling's  T2  form,  and  for  finite  samples  it  is  approximately 
distributed  as  an  nr/(n  -  r  +  1)  multiple  of  F(r,n  -  r  +  1)  under  HQ. 
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For  future  reference,  we  will  denote  this  form  as  RS(WOE)  to  mean 
"without  expectation."   We  note  here  that  A1 ,  A2,  A,  and  A,  are  based  on 
Shanon's  entropy.   Other  choices  of  A's,  and  hence  the  variants  of  RS 
test  can  be  developed  by  using  various  measures  of  entropy  [see  Uliah 
(1989) ] . 

Davidson  and  MacKinnon  (1984a)  have  proposed  an  alternative 
version  of  RS  test  that  may  improve  the  small  sample  properties.   It  can 
also  be  obtained  by  running  a  regression.   As  their  regression  is  based 
on  2n  observations,  it  is  called  RS(DLR)  to  emphasize  the  nature  of 
double  length  regression. 

All  the  above  versions  are  based  on  the  assumption  that  the 
underlying  probability  model  is  correctly  specified.   When  this 
assumption  fails,  the  above  versions  of  RS  test  will  not  have  correct 
size  even  asymptotically.   To  overcome  this  problem  White  (1982) 
proposed  using  [see  also  Kent  (1982)] 

A5<8)  =  A3  (6)  -^(6)^3  (6)  -1  (2-6) 

White's  yersion  can  also  be  expressed  in  nR  form,  where  R  is  the 
uncentered  coefficient  of  determination  of  the  regression  of  a  unit 
vector  on  G(0)A,(6)  "1H(6)  [see  Bera  and  McKenzie  (1986)]. 

If  the  maintained  hypothesis  is  correct  all  the  above  forms  are 
asymptotically  equivalent.   The  asymptotic  equivalence  of  the  tests  is 
not  necessarily  indicative  of  their  finite  sample  behavior.   Using  A3(6) 
sometimes  might  lead  to  negative  values  of  the  test  statistic  in  small 
sample  and  also  this  version  will  not  be  invariant  to  units  of 


measurement.   Moreover,  1(B)    contains  fewer  stochastic  terms  than  Aj(6) 
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and  therefore,  RS(WE)  can  be  expected  to  converge  to  %      faster  than 

RS(WOE)  under  HQ.   On  the  other  hand  Efron  and  Hinkley  (1978)  argued 

that  A3(9)  and  A4(6)  are  "closer  to  the  data"  than  J(0)  and  should  be 

favored.   Davidson  and  MacKinnon  (1983)  and  Bera  and  McKenzie  (1986) 

examined  the  small  sample  performances  of  various  versions  of  RS 

statistic.   Their  recommendations  were  to  use  RS(WE)  or  RS(DLR).   In 

certain  cases  RS(WE)  is  difficult  to  obtain  whereas  RS(DLR)  can  be 

easily  calculated.   An  example  is  the  problem  of  testing  linear  model 

within  the  Box-Cox  transformation  framework.   There  are  also 

circumstances  in  which  RS(DLR)  is  not  applicable  such  as  testing  for 

misspecif ication  in  certain  limited  dependent  variable  models.   When 

both  RS(WE)  and  RS(DLR)  are  available,  either  of  them  can  be  used  since 

there  is  not  much  difference  between  these  two  versions  of  RS  statistic 

in  terms  of  small  sample  size  and  power. 

One  of  the  most  interesting  properties  of  the  RS  test  is  its 

additivity  in  the  sense  that  the  RS  statistic  for  testing  two  (or  more) 

hypotheses  jointly  can  be  decomposed  into  sum  of  the  RS  statistics  for 

testing  the  hypotheses  individually.   This  property  was  first  noted  by 

Pesaran  (1979).   He  found  that  the  RS  test  of  the  deterministic  and 

stochastic  parts  of  a  dynamic  linear  regression  model  can  be  written  as 

the  sum  of  two  independent  parts.   In  a  more  complex  situation,  Bera  and 

Jargue  (1982)  showed  that  the  joint  RS  test  for  normality, 

homoskedasticity,  serial  independence  and  functional  form  is  the  sum  of 

the  standard  RS  tests  for  each  component  of  the  joint  hypothesis.   More 

recently,  Higgins  and  Bera  (1989)  suggested  a  simultaneous  test  for 
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autoregressive  conditional  heteroskedasticity  (ARCH)  and  bilinearity 
which  is  just  the  sum  of  individual  tests  for  ARCH  and  bilinearity. 

Let  us  assume  that  HQ  :  h(0)  =  c  naturally  partitions  into  two 
separate  sets  of  restrictions,  H.  :  h.(6)  =  c.  and  H„  :  h2(6)  =  c2  with  a 
corresponding  partition  for  H(6)  =  dh(6)/d6  as  H  =  [H1  :  H2].   For  a 
test  principle  T,  denote  by  TAB  the  statistic  for  simultaneously  testing 
both  sets  of  restrictions  HA  and  H_.   Let  T.  denote  the  statistic  for 
testing  HA  with  the  H_  restrictions  imposed,  and  similarly  T„  is 
defined.   With  respect  to  the  test  principle  T,  the  tests  for  hypotheses 
HA  and  Hg  are  said  to  be  additive  if  TAB  =  TA  +  Tg.   Bera  and  McKenzie 
(1987)  showed  that  a  necessary  and  sufficient  condition  for  additivity 
of  the  RS  tests  for  the  hypotheses  HA  and  H„  is  that 
H1'(fl)I(6)-1H2(6)  =  0. 

Aitchison  (1962)  introduced  the  concept  of  separability;  if  two 
hypotheses  are  separable  and  the  sample  size  is  large,  while  testing  one 
hypothesis  we  may  be  able  to  ignore  the  other  hypothesis.   That  is,  the 
test  is  "robust"  to  whether  the  other  hypothesis  is  true  or  not.   This 
idea  is  very  much  related  to  Stein's  (1956)  "adaptive"  test.   Results  of 

Bera  and  McKenzie  (1987)  imply  that  separability  of  the  RS  test  also 

« 

leads  to  its  additivity.   However,  for  the  LR  test  this  is  true  only 
asymptotically  and  for  the  W  test  the  result  does  not  hold  in  general. 
After  carrying  out  one-directional  RS  tests,  when  additivity  applies,  a 
joint  test  can  be  obtained  by  simply  adding  up  the  component  statistics. 
This  provides  an  optimal  way  to  combine  different  diagnostic  tests  of 
econometric  models  [see  Pagan  and  Hall  (1983,  p.  152)  and  Pagan  (1984, 
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p.  126) ] .   For  the  LR  and  sometimes  for  W  tests,  such  an  operation  is 
valid  only  asymptotically. 

3.   SOME  SPECIFIC  APPLICATIONS  OF  RS  TESTS  TO 
ECONOMETRIC  MODEL  EVALUATION 

Byron  (1968)  was  probably  the  first  to  introduce  the  RS  test  to 
the  econometrics  literature.   He  used  Silvey's  LM  version  for  testing 
linear  restrictions  in  demand  system.   However,  we  had  to  wait  more  than 
a  decade  to  realize  the  potential  of  the  RS  test  in  econometrics.   The 
earlier  notable  papers  were  Savin  (1976),  Berndt  and  Savin  (1977), 
Breusch  (1978,  1979)  and  Godfrey  (1978a, b,c).   Possibly,  Breusch  and 
Pagan  (1980)  has  been  the  most  influential.   It  presented  the  RS  test  in 
a  general  framework  in  the  context  of  econometric  model  evaluation  and 
discussed  many  applications.   Engle's  (1984)  survey  provided  excellent 
progress  report  up  to  early  eighties.   Now  there  is  a  full  length 
monograph,  Godfrey  (1988)  which  gives  a  very  comprehensive  account  of 
most  of  the  available  RS  tests  in  econometrics.   And  there  is  no  doubt 
that  in  coming  years  a  few  more  survey  papers  and  monographs  will  be 
written.   In  this  section,  we  do  not  make  any  attempt  to  provide  a 
complete  list  of  all  applications  of  the  RS  test  in  econometrics.   We 
just  mention  some  of  the  commonly  used  RS  tests  and  discuss  their  links 
with  the  older  tests. 

One  of  the  oldest  test  in  statistics  and  econometrics  is  the 
Pearson  x2  goodness-of-f it  test.   Using  a  multinomial  likelihood 
function  it  can  be  easily  shown  that  this  is  a  RS  test  [see  Rao  (1973, 
p.  442)].   The  analysis  of  classical  linear  regression  model 
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yt   =  x'ifi  *   Cj,   i  =  1,2 /3  (3-1) 

is  often  based  on  four  basic  assumptions,  correct  linear  functional  form 
and  the  assumptions  of  disturbance  normality,  homoskedasticity  and 
serial  independence.   Violation  of  these  affects  both  the  estimation  and 
inference  results.   Until  recently  these  basic  assumptions  were  not 
tested  thoroughly.   With  the  aid  of  RS  principle,  numerous  procedures 
have  been  proposed  to  test  the  above  assumptions  and  these  are  now 
routinely  reported  in  most  of  the  standard  econometric  packages  such  as 
PC-GIVE,  MICROFIT  and  SHAZAM.   Just  to  name  some  of  the  RS  test 
procedures,  test  for  normality  has  been  derived  by  Bera  and  Jarque 
(1981)  and  Jarque  and  Bera  (1987),  Godfrey  (1978c)  and  Breusch  and  Pagan 
(1979)  proposed  tests  for  homoskedasticity  and  Breusch  (1978)  and 
Godfrey  ( 1978a, b)  developed  tests  for  serial  independence.   The  older 
tests  like  Durbin  and  Watson  (1950),  Chow  (1960),  Goldfeld  and  Quandt 
(1965),  Durbin  (1970)  could  also  be  given  RS  test  interpretation. 

To  see  the  attractiveness  of  the  RS  test,  let  us  briefly  consider 
the  test  for  normality.   Bera  and  Jarque  (1981)  started  with  the  Pearson 
family  of  distributions.   Given  the  complexity  of  the  ML  estimation  of 
the  Pearson  family,  W  and  LR  tests  are  ruled  out  from  a  practical  point 
of  view.   However,  the  RS  test  is  based  on 


RS  =  n 


yst  ♦  &  - 3)2  (3.2) 

6         24 


n 


where  VbT  =  /j3//A/2,  b2  =  Pa/^  and  ^'  =   2   s./n  where  e-  =  y-  -  x .  B  are 

i=l 
the  OLS  residuals.   It  turns  out  that  this  test  was  also  mentioned  by 
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Bowman  and  Shenton  (1975)  and  was  hardly  used  in  statistical  work  due  to 
lack  of  theoretical  justification.   Now  we  note  that  RS  approach 
uncovers  a  principle  that  proves  the  asymptotic  efficiency  of  the  test 
in  (3.2).   As  we  mentioned  earlier  if  Gram-Charlier  (Type-A)  expansion 
is  used  instead  of  the  Pearson  family  as  the  alternative  distribution, 
the  same  test  would  result.   Furthermore,  once  the  test  statistic  or 
just  the  score  vector  is  derived,  for  the  RS  test,  the  alternative 
hypothesis  is  irrelevant.   The  above  test  is  based  on  two  moments,  third 
and  fourth.   One  could  have  started  with  these  moments  directly  without 
going  through  the  derivation.   In  that  sense  RS  test  has  moment  test 
interpretation  [see  Newey  (1985),  Tauchen  (1985)  and  Pagan  and  Vella 
(1989) ]. 

As  mentioned  previously,  the  first  application  of  RS  test  to 
econometrics  was  in  testing  linear  restriction  in  the  demand  system 
[Byron  (1968)].   The  null  hypothesis  can  be  stated  as  H  :  RB  =  c  for 
the  model  (3.1)  where  R  is  a  known  constant  matrix.   Berndt  and  Savin 
(1977)  showed  for  this  problem,  under  the  assumption  of  normality  of  c, 
W  >  LR  >  RS  [see  also  Rothenberg  (1984)].   This  implies  that  if  we  use 
the  same  critical  value  for  the  three  statistics,  it  might  result  in 
conflicting  inference.   Geweke  (1981)  reached  similar  conclusion  by 
comparing  the  Bahadur  slopes  of  these  three  tests  for  linear  restriction 
in  the  regression  model  with  nonscalar  covariance  matrix  [also  see  Magee 
(1987)].   When  V(c)  =  Ino2,  W  is  a  linear  function  of  the  F-test  and 
exact  critical  values  for  all  the  three  tests  can  be  obtained  [see  Bera 
(1982c)].   Ullah  and  Zinde-Walsh  (1984)  studied  the  properties  of  the 
three  test  statistics  particularly  the  above  inequalities  when  the 
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disturbance  term  follows  a  multivariate  t-distribution.   They  found  that 
although  there  was  no  change  in  the  LR  statistic,  the  inequalities  were 
no  longer  true.   The  relative  magnitude  of  the  degree  of  freedom  of  the 
t-distribution  with  respect  to  the  sample  size  was  the  crucial  factor. 
Ullah  and  Zinde-Walsh  (1985)  and  Zinde-Walsh  and  Ullah  (1987) 
generalized  these  results  to  spherically  symmetric  and  elliptical  error 
distributions,  respectively. 

Apart  from  its  local  asymptotic  efficiency,  one  of  the  major 
attractions  of  the  RS  test,  as  we  stressed  before,  is  its  computational 
simplicity.   However,  with  the  current  stage  of  computing,  this  of 
course  is  not  a  decisive  factor.   What  is  attractive  is  that  in  most 
cases  we  can  represent  the  RS  test  simply  as  a  variable  addition  test  in 
the  context  of  regression  model.   To  demonstrate  this  through  a  simple 
case,  let  us  consider  a  simplified  version  of  the  Durbin  (1970)  example. 
Suppose  we  have  the  following  regression  model 

yi  =  a-Yi.x   +  XiP  +  ux,       i  =  1,2,  ...  ,a 

Ui    ■    02Ui-l    +    Ei'  l02l<l 

« 
ei  -  IIDN(0,a2) 

and  we  are  interested  in  testing  HQ  :  62  =  0.   There  are  a  number  of 
ways  to  carry  out  the  RS  tests.   One  simple  way  is  to  test  significance 
of  the  coefficient  of  u-  .  in  the  following  artificial  regression 


Yi  =  ayi-i  +  *iP  +  Mi-i  +  wi  (3,3) 
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where  u{'s  are  the  OLS  residuals  [see  Godfrey  and  Wickens  (1982)].   It 

is  clear  from  (3.3)  that  the  test  is  based  on  62  =  (s  u,u,-i/s  "i-i)'  an 

i       i 
ML  type  estimate  of  0,.   There  are  many  instances  where  RS  tests  can  be 

performed  by  adding  certain  extra  variables  to  the  original  regression 

model  and  examining  the  significance  of  these  extra  regressors.   For 

more  on  variable  addition  tests  and  artificial  regression  approach  to 

testing,  see  Pagan  (1984)  and  MacKinnon  (1990). 

Tests  developed  for  the  standard  regression  model  have  been 

generalized  to  testing  for  the  limited  dependent  variable,  discrete 

choice,  disequilibrium  and  simultaneous  equation  models.   Specification 

tests  are  more  important  for  these  models  since  violation  of  ideal 

conditions  have  more  serious  consequences  for  these  specialized 

econometric  models.   A  large  number  of  papers  discuss  the  details  of  RS 

test  for  these  models  [for  example,  see  Jarque  and  Bera  (1982),  Bera, 

Jarque  and  Lee  (1984),  Davidson  and  MacKinnon  (1984b),  Engle  (1984),  Lee 

and  Maddala  (1985),  Robinson,  Bera  and  Jarque  (1985),  Cameron  and 

Trivedi  (1986),  Moon  (1988),  Bera  and  Robinson  (1989)  and  Pagan  and 

Vella  (1989)].   Recently,  Pagan  and  Ullah  (1990,  ch.  6)  extended  the 

variable  addition  approach  to  RS  tests  for  the  case  where  the  linear 

part  in  (3.1)  is  replaced  by  a  nonparametric  regression.   Finally,  using 

the  RS  principle,  Anselin  ( 1988a, b)  developed  several  diagnostics  for 

assessing  model  misspecif ication  due  to  spatial  dependence  and 

heterogeneity . 
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4.   SCORE  INTERPRETATION  OF  SOME  ECONOMETRIC  TESTS 

One  of  the  common  problems  in  econometrics  involves  testing  two 
economic  models  which  are  non-nested,  in  the  sense  that  one  is  not  a 
special  case  of  other,  or  one  cannot  be  obtained  from  the  other  under 
some  limiting  values  of  the  parameters.   For  this  problem  standard  LR 
test  is  not  applicable.   Cox  (1961,  1962)  suggested  a  test  which  is 
based  on  the  ratio  of  maximized  loglikelihoods  less  an  estimate  of  its 
expected  value  under  the  tested  hypothesis.   Breusch  and  Pagan  (1980, 
p.  248)  made  the  first  attempt  to  relate  the  Cox  approach  to  RS  test. 
However,  there  was  some  confusion  as  indicated  in  Pesaran  (1982),  due  to 
the  fact  that  under  the  tested  hypothesis  parameters  of  the  alternative 
model  are  not  identified  within  the  comprehensive  model  framework.   This 
is  the  familiar  Davies  (1977,  1987)  problem.   Dastoor  (1985)  clearly 
showed  that  Cox  test  is  essentially  a  RS  test  if  the  parameters  of  the 
alternative  hypothesis  are  replaced  by  their  estimates  before  applying 
the  score  principle. 

During  the  sixties  when  econometrics  was  rather  in  a  wilderness 
stage,  Durbin  (1970)  solved  some  of  the  puzzles  in  model  testing, 
particularly  the  use  of  Durbin-Watson  test  in  the  presence  of  lagged 
dependent  variables.   Suppose  we  partition  G  as  6  =  [6^  :  6^]'  and  want 
to  test  H  :  8,  =  6-  . 

0      Z  20 

Let  us  define 

6X  =  MLE  of  B1  given  62  =  62o 
and 
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82  =  MLE  of  82  given  6t  =  6X 

Durbin's    (1970)    test    is   based  on    (62   -   62o)  .      However,    we   can 
write 


mBx.B2)        dt(Blte2o)        gttMgj 

^2  ^2  ae2ae2 


where  6   lies  in  between  82  and  62o.   Since  in  large  sample  the  Hessian 
matrix  can  be  taken  to  be  nonsingular,  a  test  based  on  (62  -  62o)  is 
asymptotically  equivalent  to  a  test  that  uses  d£ (61 ,  62o) /362-   This 
latter  quantity  is  the  score  vector  with  respect  to  92  and  the  RS  test 
for  HQ  :  62  =  82o  uses  exactly  this. 

Hausman  (1978)  test  is  based  on  a  simple  principle  [see  also 
Durbin  (1954)].   Assume  that  under  the  null  hypothesis  of  no 
misspecif ication,  the  consistent,  asymptotically  normal  and  efficient 
estimator  of  6  is  6,  which  under  the  alternative  hypothesis  of 
misspecif ication  is  inconsistent.   Suppose  there  is  another  estimator,  6 
which  is  consistent  (with  the  same  rate  of  convergence  as  6)  under  the 
null  as  well  as  under  the  alternative.   Then  the  Hausman  test  considers 
the  quantity  u  =  6  -  6 .   {It  is  also  interesting  to  note  that  Hausman 's 
computation  of  the  asymptotic  variance  of  /no  as  V(V~n6)  -  V(/n6)  under 
the  null  is  based  on  a  theorem  by  Rao  (1973,  p.  317)].   Let  us  now 
consider  a  special  case  of  testing  H  :  62  =  6-.  where  0  =  (8^,82)'.   We 
use  the  same  notations  as  before.   In  particular,  61  is  the  MLE  of  81 
given  62  =  62o  and  6  =  (6^,62)'  is  the  unrestricted  MLE.   As  interpreted 
by  Holly  (1982),  Hausman 's  test  is  based  on  comparing  the  two  estimates 
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61  and  91#   Since  61  is  efficient  under  H  but  usually  inconsistent  under 
H1 ,  while  61  is  consistent  under  HQ  as  well  as  H. ,  they  are  obvious 
candidates  to  use  in  constructing  the  Hausman  test,  which  is  based  on 
61  -  6,.   It  can  be  shown  that  (see  Ruud  (1984)  and  Godfrey  (1988, 
P-  34)] 


euo   e2o)      yi^.fl,) 


Therefore,  the  Hausman  test  has  score  interpretation.   However,  note 
that  in  (4.1)  the  score  is  with  respect  to  the  nuisance  parameter  G1  and 
it  is  evaluated  at  6.  not  6..   Here  (fi^Op  )  can  be  viewed  as  consistent 
estimator  for  6  under  the  null  hypothesis.   When  consistent  (rather  than 
efficient)  estimators  are  used  there  is  another  attractive  way  to 
construct  a  score  type  test  which  is  due  to  Neyman  (1959,  1979).   In  the 
literature  this  is  known  as  C(a)  or  effective  score  or  Neyman-Rao  test 
[see  Hall  and  Mathiason  (1990)].   For  this  test  we  need  /n-consistent 
estimator.   To  avoid  confusion  let  us  denote  /n-consistent  estimator  of 
6  under  HQ  :  62  =  62o  as  6+  =  (6"!j' ,6^) '  .   Then  Neyman-Rao  C(cr)  test  uses 
the  effective  score 


a^51  .  wi  _  Iiimi-im  ffi  ,4.2, 


where  J.,  are  the  appropriate  blocks  of  the  information  matrix  1(6) 
corresponding  to  6.  and  02.   It  is  clear  that  when  6  is  used  in  place  of 
6+  in  (4.2),  d£(Q)/ddy    =   0  and  the  test  reduces  to  the  standard  RS  test. 
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In  a  sense,  d*£(0)/802  is  the  residual  score  obtained  from  the  residual 

of  running  the  regression  of  3^(0) /302  on  d£-(Q)  /dQ^.      Therefore, 

d*£(&) /362  is  the  part  of  score  which  is  orthogonal  to  the  score  for  01 

[see  Neyman  (1979)  and  Hall  and  Mathiason  (1990)].   From  a  practical 

point  of  view  NR  test  is  very  attractive,  firstly  all  we  need  is 

/n-consistent  estimator  and  secondly  the  test  can  be  viewed  as  adaptive 
in  the  sense  of  Stein  (1956)  because  the  test  is  independent  of  the 

value  of  nuisance  parameter  Q^.      This  Neyman-Rao  orthogonalization 

procedure  has  wider  implications.   Recently  Bera  and  Yoon  (1990)  have 

applied  this  approach  to  develop  tests  which  are  valid  under 

misspecified  alternatives. 

In  Section  2,  we  noted  that  when  the  model  is  correctly  specified 

then  A.,(0)  =  A2(0)  [see  equations  in  (2.3)].   This  equality  can  be  used 

to  construct  a  test  for  model  specification.   White  (1982)  suggested  a 

specification  test  which  examines  whether  the  elements  of  A.,  (6)  -  A2(Q) 

are  close  to  zero  where  6  now  denotes  quasi-MLE.   This  test  is  known  as 

information  matrix  (IM)  test  since  it  exploits  the  IM  equality. 

Consider  a  vector  s(y;0)  defined  by 


s(y;0)  =  vech 


&lnf(y;Q)         dlnf{y,Q)     dlnf(y;6) 

-v\  "\/i »  '  -^  55"* 

OXjOX)  OX)  OX) 


where  "vech"  denotes  an  operator  that  stacks  distinct  elements  of  a 
symmetric  matrix.   Thus  s(y;0)  is  a  vector  with  p(p+l)/2  elements, 
generally  referred  to  as  indicators.   The  actual  test  utilizes  a  sample 
average  of  s(y,0),  namely 
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5(6)  =  -  V  s(yi#G) 

72  i-l 

If  the  model  is  correctly  specified  s(6)  should  take  small  values.   IM 
test  has  found  many  applications  in  econometrics,  for  example,  see  Hall 
(1987)  and  Bera  and  Lee  (1990).   Chesher  (1984)  provided  a  very 
interesting  interpretation  of  the  IM  test.   He  considered  the  problem  of 
testing  for  random  parameters  and  assumed  that  6  has  mean  say  8~  and 
variance  r,  a  p  x  p  matrix.   The  null  hypothesis  that  0  is  nonstochastic 
is  equivalent  to  H  :  r  =  0.   For  r  close  to  zero,  the  marginal  density 
of  y  can  be  approximated  by 

f(y;B,T)   =  f(y;B)  [l  -  tr{(F2   +  F^T}] 

where  F1  =  dlnf/39  and  F2  =  32lnf/d0d6'  .   It  is  clear  that  under 
H  :  T  =  0,  f*(y;6,0)  =  f(y;6).   Chesher  showed  that  the  score  vector 
for  testing  H  is  precisely  equal  to  ns(6).   Thus  the  IM  test  has  a 
score  test  interpretation. 

5.   ADVANTAGES  OF  THE  RS  TEST 
Apart  from  its  computational  simplicity  and  local  asymptotic 
efficiency,  RS  test  has  advantages  from  some  other  theoretical  and 
practical  points  of  view.   For  example,  consider  the  case  of  testing 
when  under  the  null  hypothesis  the  parameter  values  lie  on  the  boundary 
of  the  parameter  space.   In  this  context,  the  standard  theory  associated 
with  tests  based  on  MLE ' s  will  not  be  valid.   Large  sample  properties  of 
MLE's  and  the  associated  tests  in  boundary  situations  have  been  examined 
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by  Chernoff  (1954),  Moran  (1971),  Chant  (1974)  and  Self  and  Liang 
(1987).   One  important  general  result  from  their  investigation  is  that 
the  W  and  LR  tests  in  the  boundary  situation  will  not  follow  their  usual 
asymptotic  x2-<*istribution  while  the  asymptotic  properties  of  the  RS 
test  are  not  altered.   As  a  result  it  has  been  argued  that  the  RS  test 
is  particularly  suitable  for  testing  hypotheses  under  which  parameter 
values  are  at  the  boundary  of  the  parameter  space  [see  Godfrey  (1988, 
p.  95)]. 

Sometimes  the  W  test  might  have  some  computational  advantages. 
For  models  in  which  the  null  hypothesis  imposes  nonlinear  restrictions 
on  8,  the  unrestricted  MLE  6  is  easier  to  calculate.   Examples  of  such 
cases  can  be  found  in  the  dynamic  specification  test  of  Sargan  (1980), 
the  rational  expectation  hypothesis  test  of  Wallis  (1980)  and  Hoffman 
and  Schmidt  (1984)  and  the  test  of  market  efficiency  under  rational 
expectation  as  described  in  Baillie,  Lippens  and  McMahon  (1983). 
Although  the  W  test  might  be  easier,  particularly  in  this  kind  of 
situation  it  runs  into  a  serious  problem.   Gregory  and  Veal  (1985) 
pointed  out  that  the  numerical  value  of  the  W  statistic  is  not  invariant 
to  the  algebraically  equivalent  forms  of  the  null  hypothesis.   However, 
the  RS  test  is  invariant  to  different  equivalent  forms  of  nonlinear 
restrictions.   The  main  problem  with  the  W  test  is  that  it  uses  a  wrong 
"metric"  and  is  not  invariant  to  changes  in  co-ordinates  [see  Critchley, 
Marriott  and  Salmon  (1990)  and  Davidson  (1990)]. 

Finally,  let  us  consider  the  problem  of  testing  HQ  :  62  =  62o  when 
the  density  function  depends  on  6  =  (Q^,Q1)'  .      Suppose  given  a  value  of 
8.J,  we  can  find  a  test  statistic  for  testing  H  .   In  many  econometric 
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problems  it  happens  that  when  H  is  true  the  model  is  free  of  6.. .   In 
other  words,  the  nuisance  parameter  6-  is  identified  only  under  the 
alternative  hypothesis.   This  implies  under  H  ,  the  information  matrix 
is  singular  thus  invalidating  the  standard  test  procedures.   However, 
using  the  results  of  Davies  (1977,  1987),  it  is  easy  to  modify  the  RS 
test  to  obtain  a  valid  test  procedure.   Suppose  given  6.,,  RS  statistic 
for  testing  H  is  given  by  RS(0.).   Davies  approach  appeals  to  the 
"union-intersection  principle"  of  Roy  (1953)  and  suggest  basing  the  test 
on  a  critical  region  of  the  form 


j  sup    RSiBJ     >  k  1 

k  J 


where  k  is  chosen  to  have  an  appropriate  size  of  the  test.   In  practice 
this  method  works  very  well  as  has  been  demonstrated  by  Bera  and  Higgins 
(1990) . 

6.   SOME  RECENT  DEVELOPMENTS 
To  provide  an  intuitive  idea  behind  the  score  test,  earlier  we 
noted  that  if  H  is  true,  d(6)  should  be  close  to  zero  since  by 
construction  d(6)  =  0.   However,  Conniffe  (1987,  1988)  argued  that  a 
better  justification  comes  from  the  fact  that  E[d(6)]  =  0  at  the  true 
value  of  6  [see  also  Conniffe  (1990a)].   When  HQ  is  a  simple  hypothesis, 
then  there  is  no  problem.   However,  for  testing  a  composite  hypothesis, 
we  need  to  replace  some  components  of  0  by  their  estimates  in  d(8). 
Then  E[d(6)]  may  no  longer  be  zero.   In  that  case  Conniffe  suggested 
using 
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[d(6)  -  E{d(B)})'J(6)  [d(B)   -  E{d(B)}] 

where  J  is  the  inverse  of  the  variance  of  d(9)  -  E[d(6)]  and  he  called 

this  estimated  score  test.   For  an  example,  let  us  consider  again 

testing  for  serial  independence  in  linear  regression  model.   For 

simplicity,  we  assume  there  is  only  one  regressor  X.   In  (3.3),  we  noted 

that  the  standard  RS  test  utilizes  the  quantity  2  u.u. ...   However, 

i 


? 


Mi-i 


E,  xixi-i 


where  V(e-)  =  a2.   Therefore,  the  estimated  score  test  will  be  based 


-,  -,      ~7  E/  XiXi-i 

QlQi--L     +  5 


on 


E,*5 


Monte  Carlo  results  reported  in  Conniffe  (1990b)  show  that  estimated 
score  test  has  better  finite  sample  properties.   It  appears  that  some 
further  investigation  on  this  test  will  be  useful  in  econometrics. 

There  is  another  approach  to  get  the  size  of  RS  test  close  to  its 
preassigned  value  in  finite  sample.   In  this  case,  the  adjustment  is 
done  not  to  the  score  but  to  the  test  statistic.   Suppose  under 

Ho  :  6  =  6o'  E(RS1  "  Pf1  +  a(Q0)    +   0(l/n2)]«   Then  the  size  corrected  RS 
test  will  be 

RS'   =  RS/[1   +  a(6J] 
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Bartlett  (1937)  was  probably  the  first  to  discuss  an  adjustment  for 

testing  homogeneity  of  variance  in  the  context  of  LR  statistic.   For  RS 

test,  Harris  (1985)  suggested  an  adjustment  based  on  Edgeworth-type 

expansion.   Honda  (1989)  applied  Harris'  technique  to  RS  for 

homoskedasticity.   Dean  and  Lawless  (1989)  suggested  somewhat  different 

adjusted  score  tests  for  Poisson  model.   Monte  Carlo  results  of  Gurmu 

(1991)  indicate  that  Dean  and  Lawless  (1989)  size  correction  procedure 

is  effective  in  improving  the  small  sample  accuracy  of  RS  test.   Here  we 

should  note  that  these  modifications  have  been  done  only  on  the  basis  of 

first  moment,  and  it  is  not  clear  whether  they  would  improve  the 

approximation  to  the  upper  tail  [see  Cox  (1988)]. 

Earlier  we  discussed  the  White  (1982)  version  of  the  score  test 

when  the  model  is  estimated  by  QMLE.   His  version  of  the  test  is  valid 

even  if  the  model  is  misspecif ied.   This  kind  of  robustif ication  of  the 

score  test  has  been  found  to  be  very  useful  in  econometrics.   For 

example,  the  score  test  for  homoskedasticity,  as  suggested  by  Koenker 

(1981)  can  be  made  robust  by  replacing  2o*  in  the  test  statistic  by 

n"12(u2  -  a2)2.   Wooldridge  (1990),  following  the  approach  of  Davidson 

i 
and  MacKinnon  (1985),  suggested  a  general  procedure  for  robustif ying 

tests  for  a  specific  moment  condition  which  does  not  depend  on  the 

specification  of  some  higher  order  moments.   So  far  we  discussed  score 

test  based  on  likelihood  or  quasi-likelihood  functions.   However,  score 

type  test  has  wider  applicability.   The  estimation  could  be  based  on  a 

variety  of  techniques,  MLE,  method  of  moments,  minimum  chi-square  and 

robust  method  etc.   For  example,  a  general  estimating  equation  could  be 

S(6)  =  0.   When  S(6)  =  d£(Q)/dQ,    we  get  the  standard  score  test.   For 
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robust  estimation  S(6)  could  be  chosen  to  be  a  bounded  function  [see 
Koenker  (1982)].   Now  taking  S(6)  as  the  general  score  function,  we  can 
derive  score  type  tests  [for  details  see  Basawa  (1985)].   Neyman's  C(a) 
test  can  also  be  viewed  along  this  line.   For  testing  HQ  :  62  ■  62o  in 
the  presence  of  nuisance  parameter  8.,  the  estimating  equation  is 


dt(B+) 


-  i2l(e+)i;i{B+)*vp-  =  o 


ae2         21X    '-«*-  '    de1 

Robustness  of  a  test  can  also  be  judged  from  a  different  point  of 
view,  by  explicitly  parametrizing  the  true  model.   Let  us  assume  that 
the  true  model  is  represented  by  a  density  g(y;6,Y)  where 
g(y;6/0)  =  f(y;6)  which  is  our  assumed  model.   Suppose  we  test 
H  :  6  =  6  in  f(y;8)  then  a  natural  question  is  to  ask  how  can  we 
interpret  the  results  of  the  test  of  H  when  g(y;6,y)  is  true  density. 
Haavelmo  (1944)  called  f(y;6)  the  priori  admissible  hypothesis.   He  also 
noted  that  we  should  study  the  properties  of  our  test  under  certain 
alternatives  not  contained  in  our  priori  admissible  hypothesis,  since  it 
is  quite  possible  that  a  certain  outside  scheme  is  the  true  one  having 
serious  consequences  for  our  inference.   Davidson  and  MacKinnon  (1987) 
and  Saikkonen  (1989)  addressed  this  question  and  showed  that  RS  test  can 
have  substantial  power  even  when  H  is  true  if  y   f   0.   The  power  very 
much  depends  on  the  off-diagonal  element  of  the  information  matrix, 
namely  on 


I*   =  E 


dlng(y;Q,y)    .  dlng(y,Q,  yj 
ae  dy' 
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If  Iq  =  0,  then  the  presence  of  y   in  the  density  function 
asymptotically  does  not  affect  the  inference  on  0.   It  is  possible  to 
calculate  the  precise  affect  of  y,    and  using  that  Bera  and  Yoon  (1990) 
suggested  some  adjustment  to  the  RS  test  to  adapt  it  for  the  parameter 

Y- 

For  the  linear  model,  traditionally  the  score  test  has  been 
developed  under  the  assumption  of  normality,  and  then  it  is  studied 
whether  the  limit  distribution  is  invariant  to  a  wide  variety  of 
densities  for  the  error  e.   A  score  test  for  the  omitted  variable  z- 
from  the  regression  (3.1)  is  simply  the  test  for  E(z-e-)  =  0,  and  the 
test  statistic  is  computed  as  n  times  the  R2  from  the  regression  of  1.  on 
(x.e . ,  z-e  - )  .   If  the  error  is  not  normal  the  RS  test  will  be  the  test  for 
E(z-s-)  =  0,  where  s,  is  the  score  defined  as  the  ratio  of  the 
derivative  of  the  density  and  the  density  itself.   The  RS  statistic,  in 
this  case,  can  be  computed  as  nR  from  the  regression  of  1.  on  (x-§.,z-s.) 
where  s.  is  the  estimated  score.   Note  that  under  normality  Sj  =  -o~*e-, 
and  the  above  two  approaches  are  the  same.   Other  diagnostic  tests ^can 
similarly  be  developed  for  non-normal  cases.   For  these  and  estimation 
of  8^,  see  Pagan  and  Ullah  (1986,  ch.  6). 

A  result  very  often  cited  in  the  econometric  literature  is  that 
asymptotically  all  the  three  tests  LR,  W  and  RS  are  eguivalent  under  the 
null  and  local  alternatives.   This  result  is  based  on  first-order 
asymptotic  theory.   Rao  (1962)  conjectured  that  using  higher  order 
asymptotics,  RS  test  can  be  shown  to  be  locally  more  powerful  than  the 
LR  and  W  tests.   In  a  series  of  papers  Chandra  and  Joshi  (1983),  Chandra 
and  Mukherjee  (1984,  1985),  Chandra  and  Samanta  (1988)  and  Mukherjee 
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(1990)  proved  this  conjecture  for  various  cases  using  second  and  third 
order  asymptotics.   One  interesting  result  is  that  the  differences  in 
power  depend  on  Efron's  (1975)  curvature  at  6Q.   For  exponential  family 
statistical  curvature  is  zero  and  large  curvature  indicates  breakdown  of 
some  standard  results  in  estimation  and  testing.   Since  in  econometrics, 
we  very  often  deal  with  non-exponential  families,  the  above  result  has 
some  serious  implications.   Amari  (1985,  p.  200)  indicated  that  the  W 
test  might  do  well  powerwise  for  distant  alternatives  since  it  is  based 
on  estimates  under  the  alternative  hypothesis.   King's  (1988)  point 
optimal  test  can  be  viewed  in  this  light,  since  he  suggested  to  choose 
some  parameter  values  away  from  the  null  hypothesis  to  construct  a  test 
with  high  power.   Our  conclusions  will  be  the  same  as  in  Conniffe 
(1990a,  p.  105).   That  is,  the  score  test  is  the  most  powerful  for 
alternatives  close  to  the  null,  W  test  is  best  for  distant  alternatives 
while  the  LR  test  has  some  advantage  for  intermediate  alternatives. 

7.   EPILOGUE 
In  a  recent  interview  Professor  Rao  was  asked  about  his  favorite 
publications  among  his  many  books  and  papers  [see  DeGroot  (1987)].   Part 
of  Rao's  reply  was  ".  .  .  In  1947  ...  I  introduced  two  general 
asymptotic  test  criteria  called  score  tests  for  simple  and  composite 
hypotheses  as  alternative  to  Wald's  tests.   I  find  that  my  score  test 
for  composite  hypotheses  has  become  entrenched  in  the  econometrics 
literature  under  a  fancier  name,  the  Lagrange  Multiplier  Test.   So  those 
are  a  few  papers  which  I  like  and  which  have  received  some  attention." 
Many  of  the  techniques  that  are  currently  being  used  in  econometric 
theory  and  practice  have  their  origin  in  the  statistics  literature. 
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From  that  point  of  view,  Rao's  paper  must  be  one  of  the  most 

influential.   To  make  the  transition  from  statistical  theory  to 

applications,  econometricians  played  a  very  prominent  role.   Of  course, 

there  are  still  many  unsolved  problems.   Pagan  (1990,  p.  279)  summarizes 

the  current  state  rather  succinctly. 

How  to  design  a  test  is  now  well  understood,  and 
it  seems  unlikely  that  we  will  see  much  that  is 
new  in  this  area.   How  to  judge  it  remains  far 
more  uncertain,  and  ultimately  it  will  be  our 
response  to  this  challenge  that  will  decide  the 
utility  of  testing. 


AB.1-8 
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